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1,0  Introduction 


Establishing  Tools  for  Computing  Hybrids  (ETCH)  provides  a  starting  point  from  which 
to  explore  how  advanced  computing  technology  might  enable  intelligent  systems  and 
provide  new  capabilities  to  the  Air  Eorce.  One  research  thrust  of  the  Advanced 
Computing  Architectures  Eocus  Area  within  the  Air  Eorce  Research  Eaboratory’s 
Information  Directorate  (IE)  examines  novel  information  processing  paradigms  for  new, 
radical  methods  of  processing  information.  [5  3]  This  research  encompasses 
nanotechnology,  biotechnology  and  the  quantum  sciences.  The  scientists  and  engineers 
conducting  the  research  are  looking  for  the  means  to  either  conduct  existing 
computational  tasks  better  or  address  computational  problems  that  classical  computing 
has  difficulty  solving.  Classical  computing  for  this  project  was  viewed  as  silicon  based 
computing  utilized  in  general  purpose  computing.  Examples  of  computing  that  would  be 
referred  to  as  non-classical  are  quantum  and  biomolecular  computing.  Within  the 
Advanced  Computing  Technology  Branch  (lETC)  of  the  Information  Directorate  the 
interest  in  bio-molecular  computing  and  quantum  computing  is  pursued  under  the 
perspective  that  biomolecular  and  quantum  computing  do  not  represent  technologies  that 
will  replace  general-purpose  silicon  computing,  but  represent  complementary 
technologies.  The  common  viewpoint  is  that  as  these  technologies  mature  they  will  be 
realized  through  the  creation  of  a  hybrid  architecture  that  integrates  a  combination  of 
biotechnology,  quantum  science,  nanotechnology,  and  general  purpose  silicon-based 
computing  technology. 

Over  the  past  several  years,  members  of  the  Information  Technology  Division  have 
supported  several  Defense  Advanced  Research  Projects  Agency  (DARPA)  Programs  that 
fall  under  the  broad  category  of  biotechnology.  These  programs  were:  Bio-Eluidic  Chips 
(BioElips)  that  demonstrated  the  integration  of  biofluidics  with  electronics.  Simulation  of 
Biological  Systems  (SIMBIOSYS)  which  expanded  the  capability  of  multiphysics  design 
tools,  and  the  Bio-Computation  (BioCOMP)  Program  which  sought  to,  “...explore, 
develop  and  exploit  both  the  role  computation  plays  in  biology  and  the  role  biology  plays 
in  computation.”[25]  Unfortunately,  the  role  biology  plays  in  computation;  a  primary 
interested  for  the  IP’s  involvement  in  the  BioCOMP  Program  was  terminated  early  due  to 
budget  cuts.  A  majority  of  the  technology  demonstrations  for  these  programs  focused  on 
the  health  and  medical  fields  since  they  hold  the  strongest  promise  of  future  funding  to 
develop  the  technology  further.  The  results  of  these  programs  will  be  leveraged  in 
demonstrating  how  biotechnology  can  provide  new  capabilities  for  information 
dominance.  Before  the  capability  of  a  biologically-based  information  system  concept  can 
be  demonstrated,  however,  more  research  is  required  to  develop  the  technology  to  the 
level  of  maturity  commensurate  with  practical  application.  This  project  examined  the 
following  three  questions: 

a.  Can  IP  leverage  biotechnology  for  information  dominance? 

b.  If  so,  can  computational  biology  tools  such  as  those  incorporated  into  Bio- 

SPICE  be  used  by  engineers  to  develop  biotechnology  to  the  point  of  maturity 

that  would  advocate  integration  into  future  information  systems? 
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c.  Can  Bio-SPICE  in  particular  be  used  to  unleash  the  potential  of  bioteehnology 
for  information  systems? 


For  the  purposes  of  this  report,  the  following  definitions  were  used  to  eategorize 
reviewed  researeh: 

Biomoleeular  Computing  is  eomputing  with  biologieal  material. 

Bio-inspired  eomputing  is  the  mimieking  of  biologieal  proeesses  on  silieon. 

Bioinformaties  is,  “an  interdiseiplinary  field  bringing  together  biology,  eomputer 
seienee,  mathematies,  statisties,  and  information  theory  to  analyze  biologieal  data 
for  interpretation  and  prediotion.”[8]  Basieally,  this  involves  the  applieation  of 
eomputer  seienee  to  proeess  data  for  medieal  and  biologieal  researeh. 

Bioeomputing  is  the  broad  term  that  eovers  the  pursuit  of  eomputing  teehnology 
whieh  eneompasses  biomoleeular  eomputing,  bio-inspired  eomputing,  and 
bioinformaties. 

This  report  begins  by  examining  how  bioteehnology  ean  be  used  in  future  advaneed 
information  systems.  It  then  examines  eomputational  biology  tools,  sueh  as  those  found 
in  the  Bio-SPICE  open  souree  software  environment,  whieh  may  be  used  to  develop 
bioteehnology.  The  report  then  diseusses  how  Bio-SPICE  eould  be  used  to  develop  the 
bioteehnology  for  information  systems  examining  ideas  for  biomoleeular  eomputing. 
Finally,  teehnology  gaps  related  to  the  development  of  hybrid  eomputing  arehiteetures 
will  be  presented  before  eoneluding.  Prior  to  this  effort,  AFRL/IFTC  had  no  in-house 
experienee  with  eomputational  biology  tools  and  a  limited  knowledge  of  this  elass  tools 
was  obtained  from  managing  efforts  funded  by  DARPA. 


2,0  Biotechnology  for  Advanced  Information  Systems 

In  order  to  better  understand  how  the  Information  Direetorate  eould  benefit  from 
bioteehnology,  an  extensive  literature  seareh  was  eondueted.  Diseussions  were  also  held 
with  various  researehers  during  different  DARPA  Prineipal  Investigator  Meetings, 
eonferenees,  site  meetings,  ete.  to  determine  the  future  direetion  of  bioteehnology. 
Artieles  and  books  were  organized  by  potential  applieation  of  bioteehnology.  The  main 
applieation  eategories  are:  eomputing,  memory,  network  seeurity/management, 
nanofabrieation,  self-assembly/reeonfigurability,  and  bioinformaties. 

2,1  Computing 

Computing  is  one  applieation  area  for  bioteehnology  that  is  of  partieular  interest  for  the 
Advaneed  Computing  Arehiteetures  Foeus  Area  at  the  Information  Direetorate. 
Bioteehnology  holds  the  promise  of  addressing  problems  that  have  proven  diffieult  or 
even  possible  to  address  with  elassieal  eomputing.  “The  present  interest  in  bioeomputing 
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is  due  to  many  factors.  The  first  impetus  arises  from  electronics.  Limits  are  extended 
year  after  year,  yet  as  some  point  the  size,  speed,  and  power  dissipation  of  switches  based 
on  silicon  or  other  conventional  materials  will  run  into  the  deadlocks  set  by  the  basic 
laws  of  physics.  Already  the  ‘leakiness’  of  quantum  effects  between  nano-sized  wires 
that  are  nanometers  apart  has  sparked  interest  in  somehow  harnessing  the  power 
(quantum  cellular  automata,  etc.)  of  quantum  mechanical  devices.  The  second  impetus  is 
that  although  conventional  computer  science  has  been  extremely  successful,  a  number  of 
critical  problems  in  information  processing  have  persisted  stubbornly  beyond  reach: 
pattern  recognition,  learning,  and  parallelism  being  three  examples  of  where  biological 
systems  still  remain  far  advanced  beyond  their  silicon  mimics. ”[48]  Several  articles  have 
looked  at  the  physical  limits  of  current  computer  technology  [14,  50,  58]  and  debated 
whether  or  not  the  demise  of  Moore’s  Law  is  on  the  horizon  necessitating  the  need  for 
alternative  computing  technologies  [24,  28,  30,  34,  42,  46,  52,  55,  62]. 

There  are  many  different  theoretical  perspectives  on  how  one  might  leverage 
biotechnology  to  achieve  new  computing  capability  which  makes  it  difficult  to  locate  the 
best  path  to  a  practical,  realizable  concept.  Highlighting  the  diversity  is  the  number  of 
names  of  biocomputing  that  were  encountered  during  the  literature  review  which 
included:  DNA  computing,  molecular  computing,  biomolecular  computing,  bioinspired 
computing,  membrane  computing,  reaction-diffusion  computing,  and  chemical  based 
computing.  Periodically  papers  were  written  that  provided  a  snapshot  of  what  progress 
was  made,  the  direction  of  ongoing  research,  and  research  challenges.  [10,  15,  39,  40,  41] 
The  literature  review  reaffirmed,  that  it  is  not  a  question  if  biotechnology  can  be 
leveraged  and  developed  into  new  technologies,  but  when  biotechnology  will  be 
integrated  into  advanced  systems.  One  source  supporting  this  opinion  is  The 
International  Technology  Roadmap  for  Semiconductors  (ITRS)  which  includes 
information  regarding  molecular  devices,  molecular  memories  and  biologically  inspired 
architectures  as  experts  look  at  the  status  of  the  semiconductor  industry  and  its  future 
direction.  The  roadmap  perspective  is  that  of  emulating  human  and  biological  reasoning 
functions  through  biologically  inspired  computing  which  utilizes  the  technology  growth 
of  the  semiconductor  industry  and  the  existing  infrastructure.  Metrics  were  established 
on  which  to  measure  the  progress.  One  set  of  metrics  compiled  for  the  ITRS  are  the 
fundamental  parameters  of  the  human  brain  [22]  which  include: 

a.  Number  of  neurons  -  2E 10 

b.  A  single  neuron  can  make  100  to  10,000  synaptic  connections 

c.  Power  consumption  -  15-30  Watts 

d.  Information  Stored  -  IE  12  (short  term)  bits 

e.  Information  process  speed  -  IEI3  bits/second 

A  question  that  came  to  mind,  as  the  ITRS  information  was  reviewed,  is  if  we  restrict 
ourselves  to  developing  only  biotechnology  supported  by  the  infrastructure  of  the 
semiconductor  industry,  are  we  inhibiting  the  exploitation  of  the  full  power  of  biology  for 
computing?  At  this  time,  the  answer  is  yes  because  we  are  far  from  understanding  the 
true  potential  of  biotechnology.  Even  the  ITRS,  with  its  narrowed  focus,  acknowledges 
that  much  more  research  is  still  needed.  Garzon  and  Deaton  stated  that,  “It  is  increasingly 
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clear,  however,  that  understanding  the  aetual  power  of  biomoleeules  to  solve 
eomputational  problems  in  praetiee  requires  developing  the  notion  of  eomplexity  that 
eaptures  the  physieo-ehemieal  reality  in  whieh  they  take  plaee  (entirely  different  from 
VLSI-based  programs),  so  that  it  results  ean  be  used  to  gauge  the  true  eomputational 
power  of  moleeular  oomputmg.”[15]  They  also  diseussed  the  fundamental  ehallenges 
that  needed,  “to  be  resolved  for  bringing  moleeular  eomputing  to  an  effeetive  new 
paradigm  for  eomputation  soienoe.”[15]  The  ehallenges  are: 

a.  Reliability,  efficieney  and  sealability 

b.  The  eneoding  problem 

e.  Error-preventing  eodes 

d.  Building  and  programming  moleeular  eomputers 

e.  Implementing  evolutionary  eomputation 

f  Autonomy  and  self-assembly 

g.  Moleeular  eomputability  and  eomplexity 

A  list  of  resourees  for  interested  parties  to  obtain  information  on  the  progress  of  the 
researeh  is  provided  in  their  paper.  The  eoneepts  of  DNA  Computing, 
Immunoeomputing,  Membrane  Computing  and  Cellular  Computing  will  be  diseussed  in 
more  detail  below. 

DNA  eomputing  involves  the  utilization  of  a  fundamental  biologieal  building  bloek  to 
earry  out  eomputational  tasks.  DNA  is  an  example  of  an  information-earrying  polymer. 
“A  number  of  labs  are  taking  different  approaehes,  but  all  take  advantage  of  the 
eomplementarity  of  the  two  strands  of  double-stranded  DNA  and  the  ability  of  strings  of 
DNA  subunits  (Called  bases  or  nueleotides)  to  bind  together  using  Watson-Criek  pairing 
(A[adenosme]  with  T[thymidme]  and  C[eytosme]  with  G[guanidine]),  says  Seeman.”[27] 
Adleman  demonstrated  DNA  eomputing  in  1994  on  a  Hamiltonian  Path  Problem.[l] 

While  the  approaeh  he  used  is  not  praetieal  for  large  problems  his  experiment 
demonstrated  two  things.  First,  that  eomputing  ean  be  done  with  biologieal  material  in  a 
eontrolled  environment.  Seeond,  eomputing  ean  be  eondueted  with  very  small  struetures, 
struetures  mueh  smaller  than  the  eurrent  state-of-the-art  eomputing  teehnology.  His 
sueeess  also  raised  the  interest  of  other  researehers  sparking  renewed  interest  in 
leveraging  biology  in  order  to  obtain  new  eomputational  eapability.  MeCaskill,  van 
Noort,  and  others  eontinue  to  pursue  DNA  eomputing  developing  more  praetieal  means 
of  implementing  the  eomputing  oonoept.[32,  33,  56,  57]  Some  researeh  teams  looked  at 
the  DNA  strand  design  issue  for  elassieal  eomputation  and  self-assembly  eomputation  [6], 
others  examined  the  programming  environment  for  DNA  oomputing[7],  while  others 
addressed  the  role  surfaee  seienee  will  play  in  DNA  eomputing  beeoming  a  reality. [16] 
One  overview  of  this  area  of  researeh  stated:  “Whieh  tools  and  algorithms  will  turn  out 
best  for  DNA  eomputing  is  still  an  open  question.”  [29]  DNA,  however,  is  not  the  only 
aspeet  of  biology  researehers  are  pursuing. 

“The  natural  immune  system  is  a  subjeet  of  great  interest  beeause  it  provides  an  exeellent 
model  of  adaptive  proeesses  operating  at  a  loeal  level  and  of  useful  behavior  emerging  at 
the  global  level;  therefore,  it  inspires  new  powerful  paradigms  for  information  proeessing 
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and  computing. ”[63]  The  immediate  application  of  artificial  immune  systems,  a  bio¬ 
inspired  technology,  is  network  security  which  will  be  discussed  in  more  detail  later.  It 
also  seems  that,  after  a  brief  review  of  published  work,  the  functionality  of  the  immune 
system  could  play  a  role  in  the  development  of  eognizant  computing  or  in  the  processing 
of  massive  amounts  intelligenee  gathered  from  different  sourees.  Researchers  at  the  Jet 
Propulsion  Laboratory  looked  at  a  means  to  incorporate  the,  “...ability  to  take  the  position 
of  an  observer  in  relation  to  one’s  own  thoughts. ..”[63]  While  researehers  struggle  to 
deeipher  and  understand  the  natural  immune  system,  several  coneepts  repeatedly  surfaced 
sueh  as  the  immune  systems  ability  to  recognize  patterns,  deteet  abnormal  behavior,  and 
respond  to  ever  ehanging  threats.  Maturity  of  this  technology  area  continues  as 
researehers  gain  more  knowledge  just  like  other  bio-inspired  teehnologies  such  as 
membrane  computing. 

The  eoncept  of  membrane  eomputing  is  relatively  new,  coming  into  existence  in  1998 
when  Gheorghe  Paun  published  his  first  paper  on  the  subject.  He  stated  that,  “It  must  be 
stressed  that  membrane  systems  (also  called  P  systems)  are  not  intended  to  model  the 
funetioning  of  biologieal  membranes. ”[35]  “Membrane  Computing  is  a  bio-inspired 
branch  of  natural  computing,  abstracting  computing  models  from  the  structure  and 
funetioning  of  living  cells  and  from  the  organization  of  eells  in  tissues  or  other  high  order 
struetures.”  [37]  “In  short,  it  deals  with  distributed  and  parallel  computing  models, 
proeessing  multisets  of  symbol-objects  in  a  localized  manner  (evolution  rules  and 
evolving  objects  are  eneapsulated  into  eompartments  by  membranes),  with  an  essential 
role  played  by  the  eommunication  among  compartments  (with  the  environment  as 
well).  ..The  essential  ingredient  of  a  P  system  is  its  member  structure,  whieh  ean  be  a 
hierarehieal  arrangement  of  membranes,  like  a  cell  (hence  deseribed  by  a  tree),  or  a  net  of 
membranes  (plaeed  in  the  nodes  of  a  graph),  like  in  a  tissue,  or  in  a  neural  net.”[36] 

Figure  1  illustrates  the  concept  of  a  elassical  P  system  that  Paun  has  used  in  several 
papers.  Figure  2  shows  the  tree  diagram  of  the  same  example. 


Figure  1.  Illustration  of  a  membrane  structure  of  a  P  system  with  fundamental 

eomponents  highlighted. 
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Figure  2.  Tree  diagram  for  example  membrane  structure  sample  illustrated  in  Figure  1. 

Several  variations  of  P  systems  exist.  A  good  web  site  for  additional  information  is: 
http://www.psvstems.disco.unimb.it.  Features  of  P  systems  that  make  it  attractive  for 
different  applications  are:  distribution,  discrete  mathematics,  algorithmicity, 
scalability/extensibility,  transparency,  parallelism,  non-determinism  and 
communication.[36]  Application  areas  include:  systems  biology,  economics,  computer 
graphics,  linguistics,  and  management.  The  concept  of  actually  computing  with  cells  is  a 
different  category  of  computing. 

Cellular  computing  is,  “A  computing  philosophy  based  on  simplicity,  vast  parallelism 
and  locality.”[47]  Several  different  definitions  of  cellular  computing  were  found  during 
the  literature  review.  Some  researchers  define  it  as  computing  with  living  cells  while 
others  refer  to  it  as  computing  with  engineered  cells  or  synthetic  cells.  “Sipper  defines 
three  principles  of  cellular  computing: 

1 .  Simplicity:  The  processor  in  cellular  computing  is  defined  as  a  cell.  It  can  do 
very  little  on  its  own.  The  system  performs  complex  tasks  through  the  combined 
function  and  cooperation  of  many  cells. 

2.  Vast  Parallelism:  Most  parallel  computers  have  at  most  40-60  processors. 
‘Massively  parallel’  computers,  as  they  are  called,  contain  thousands  or  tens  of 
thousands  of  processors.  In  this  system,  the  parallelism  is  exponential,  with  lOX 
processors. 

3.  Locality:  Each  cell  communicates  only  with  other  cells  that  are  close  by,  and 
each  communication  contains  only  a  small  amount  of  information.  There  is  no 
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single  cell  that  has  an  overview  of  the  computer  system,  and  no  single  cell  that 
controls  the  entire  system  i.e.  there  is  no  CPU  or  central  processing  unit. 

Systems  which  use  bacteria  as  substrates  for  engineering  meet  these  three  criteria  for  a 
cellular  computing  system.”[47]  The  key  to  this  computing  concept  is  controlling  the 
behavior  of  cells.  It  was  recognized  that  the  processing  speed  of  hardware  will  be  slow, 
but  it  may  be  possible  to  compensate  for  this  deficiency  with  the  appropriate  software. 
Research  in  Amorphus  Computing  has  been  identified  as  providing  key  insight  into 
programming  approaches  appropriate  for  cellular  computing.  [2,  47] 

While  numerous  researchers  looked  at  portions  of  the  technology  needed  to  realize 
cellular  computing  some  of  the  more  significant  research  leading  to  practical 
implementation  of  cellular  computing  is  at  the  Massachusetts  Institute  of  Technology  and 
Princeton.  This  research  provides  a  basis  for  IF/IFTA  to  pursue  collaboration  with 
researchers  in  the  Human  Effectiveness  Directorate.  Weiss  et  al.  published  progress 
made  in  developing  technology  for  cellular  computing,  or  more  precisely,  “creating 
synthetic  gene  networks  for  modifying  and  extending  the  behavior  of  living  organisms.” 
[60].  “The  main  challenge  in  biocircuit  design  lies  in  selecting  well  matched  genetic 
components  that  when  coupled,  reliably  produce  the  desired  behavior.”  [60]  Research 
Weiss  was  involved  with  developed  a  design  process  to  allow  modular,  “construction  of 
complex  circuits  using  a  library  of  interchangeable  components.”  [61]  It  was  stated  that, 
“While  the  field  holds  great  promise,  we  still  face  a  number  of  challenges.  One  of  the 
major  obstacles  is  our  current  inability  to  devise  models  and  perform  simulations  that  can 
accurately  predict  the  quantitative  behavior  of  genetic  networks.  Thus,  the  difficulty  of 
inferring  function  from  DNA  sequence  limits  our  circuit  design  efficiency.  In  addition, 
we  must  delineate  the  limits  of  the  new  information  processing  capabilities  that  can  be 
embedded  into  cells.  Beyond  the  engineering  of  individual  cells,  new  programming 
paradigms  are  needed  to  achieve  the  coordinated  behavior  of  cell  aggregates.  Biological 
substrates  are  constrained  by  factors  such  as  unreliable  computing  elements,  significant 
noise,  and  imperfect  communication  with  limited  range.  Therefore,  our  designs  must 
strive  to  achieve  sufficient  reliability  and  reproducibility.  Such  a  myriad  of  challenges 
leave  much  opportunity  for  future  work.”  [60]  In  another  article  it  was  pointed  out  that, 
“We  do  not  have  a  complete  list  of  all  repressors  and  the  sites  to  which  they  bind.  We  do 
not  have  accurate  or  complete  data  on  the  kinetic  constants  involved.  We  still  do  not 
understand  the  metabolism  or  reproduction  of  cells  well  enough  to  accurately  predict  the 
effects  of  any  interference  with  them.”  [47] 

As  the  cellular  computing  concept  and  technology  mature  they,  “...will  enable  cells  to 
perform  sophisticated  digital  and  analog  computation,  both  as  individual  entities  and  as 
part  of  larger  cell  communities.  This  engineering  discipline  and  its  associated  tools  will 
advance  the  capabilities  of  genetic  engineering,  and  allow  us  to  harness  cells  for  a  myriad 
of  applications  not  previously  achievable. ”[60]  Other  application  areas  include:  human 
therapeutics,  synthesis  of  pharmaceutical  products,  molecular  fabrication  of  biomaterials, 
nanomachine  assembly,  and  toxin  detection  with  biological  sentinels. 
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2,2  Memory 


Biotechnology  may  also  provide  a  means  to  store  and  retrieve  information  more 
efficiently.  The  application  of  biotechnology  to  advanced  memory  technologies 
represents  the  most  likely  path  to  the  first  practical  realization  of  the  technology  in  an 
information  system.  There  are  many  different  proposals  for  molecular  memory 
recognized  in  the  fTRS.  One  briefly  explained  example  involved  the  storage  of  data  that 
functioned,  “by  applying  an  external  voltage  that  causes  the  transition  of  the  molecule 
into  one  of  two  possible  conduction  states.  Data  is  reading  by  measuring  resistance 
changes  in  the  molecular  cell.”  They  went  on  to  state  that,  “There  are  also  concepts  for 
combining  molecular  components  with  current  memory  technology,  such  as  DRAM  and 
floating  gate  memory. ”[22]  fn  the  research  legacy  associated  with  the  information 
Directorate,  there  was  an  investment  in  three  dimensional  protein-based  optical  memory 
utilizing  bacteriorhodopsin  and  a  sequential  one  photon  process.  Spanning  several  efforts, 
the  process  was  refined,  material  optimized,  and  prototypes  built  striving  to  provide  the 
foundation  for  a  commercially  viable  optical  memory.  [3,4,5]  Two  other  approaches  to 
protein-based  volumetric  memories  are  holographic  and  simultaneous  two-photon.  [31] 
The  Australian  Broadcasting  Company  recently  reported  that  a  prototype  device  was 
produced  setting  the  stage  for  a  bacteriorhodopsin  coated  DVD  projected  to  hold  50 
terabytes  of  information,  ft  is  estimated  that  a  USB  disk  will  be  commercialized  by  July 
2007  and  a  DVD  version  by  July  2008.  [44] 

2,3  Network  Management  and  Security 

Network  management  and  security  represents  an  area  that  will  benefit  from  bio-inspired 
technology.  Researchers  are  exploring  how  the  natural  immune  system  responds  to 
abnormal  events  or  foreign  invaders  and  how  this  knowledge  can  be  applied  to  computers 
and  computer  networks.  At  the  time  of  this  literature  review  the  impression  was  that,  for 
this  application  of  biotechnology,  research  is  focused  on  replicating  particular  end  states 
in  phases  of  immune  system  functionality,  not  the  biological  processes  that  led  to  the  end 
state.  This  is  primarily  due  to  the  fact  that,  “...the  biological  machinery  of  the  immune 
systems  is  poorly  understood,  the  only  alternative  is  to  mimic  the  phenomenology  of  its 
performance  using  some  equivalent  physical  models.”  [62]  Using  the  natural  immune 
system  as  a  guide,  the  computer  security  research  teams  are  trying  to  develop  analogous 
techniques  for  detecting  abnormal  behavior  on  computers  or  computer  networks,  and 
determining  the  proper  response.  This  research  can  be  referred  to  as  artificial  immune 
systems,  immunocomputing,  or  immunity-based  techniques,  fnterest  in  this  bio-inspired 
technology  for  computer  technology  has  grown  since  1994.  [9]  University  of  Memphis 
researchers,  for  instance,  are  interested  in  developing  a  technique  which  could 
differentiate,  “various  degrees  of  abnormality  in  network  traffic.  [9]  Researchers  at  the 
Air  Force  fnstitute  of  Technology  examined,  “the  use  of  a  distributed-agent  biological- 
system  approach  toward  the  computer-security  problems  of  virus  elimination  and 
JD.”[17]  Florida  fnstitute  of  Technology  research  has  examined  the  use  of  an  artificial 
immune  system  for  securing  ad  hoc  networks  against  intrusion  attacks.  [19]  ft  was  stated 
that,  “As  with  biological  immune  systems,  the  problem  of  protecting  computer  systems 
from  malicious  intrusions  can  be  viewed  as  the  problem  of  distinguishing  ‘self  from 
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dangerous  ‘other’  (or  ‘non-self)  and  eliminating  this  ‘other. ”’[51]  As  the  dependency  on 
computers  grows  and  knowledge  of  natural  immune  systems  increases,  this  relatively 
new  area  of  bio-inspired  technology  is  likely  to  play  a  bigger  role. 

2.4  Nanofabrication/Self-assembly/Reconfigurability 

Future  nanoelectronic  hardware  will  benefit  from  biotechnology  used  to  fabricate  new 
devices  and  architectures  when  current  fabrication  techniques  reach  their  scaling  limits. 
Traditional  lithographic  based  fabrication  approaches  will  be  replaced  by  processes  that 
leverage  the  natural  ability  of  biological  organisms  to  bind  to  a  receptor  protein,  assemble 
multiprotein  complexes,  etc.  Molecular  recognition  plays  a  major  role  in  self-assembly 
and  self-recognition.  “Molecular  recognition  is  the  binding  and  specific  selection  of 
substrate(s)  by  a  given  receptor  molecule. ...Self-assembly  refers  to  the  more  or  less 
automatic  putting  together  of  parts,  based  almost  purely  on  molecular  recognition.  Self¬ 
organization  refers  to  self-assembly  with  the  generation  of  organized  entities  and  may 
result  in  networks  and  feedback  systems  on  different  levels. ”[49]  “Molecular  synthesis 
describes  the  fabrication  of  molecules  and  macromolecules  for  application  to  molecular 
devices  ...,  thin  films,  directed,  and  self-assembly  agents.  Improved  understanding  of  the 
relationship  between  molecular  structure  and  electronic  properties  may  enable  design  and 
application  of  new  molecules  with  unique  properties  as  new  elements  of  emerging 
research  devices.  Furthermore,  needs  for  sub-nanometer  directed  and  self-assembly  may 
require  molecules  designed  to  recognize  other  molecules  or  structures  as  registration 
features.  Thus,  research  is  needed  to  understand  the  relationship  between  molecular 
structure  and  potential  assembly  mechanisms  and  their  limitations  in  directed 
assembly. ”[22]  Researchers  at  Duke  University,  for  instance,  are  not  only  conducting 
research  on  how  to  fabricate  DNA  self-assembled  computer  architectures,  but  are  also 
developing  tools  and  techniques  to  make  the  technology  commercially  viable.  They 
looked  at  carbon  nanotube  transistors  self-assembled  into  circuits  using  DNA  [11]  along 
with  metallized  DNA  to  form  decoupled  array  multi-processors  and  oracles. [12]  They 
also  developed  a  system  level  design  approach  to  evaluate  self-assembled  computer 
architectures. [13]  Several  agencies  are  funding  research  and  IF  has  executed  contractual 
efforts  as  it  pursues  the  integration  of  biotechnology  into  future  information  systems. 

2.5  Bioinformatics 

Bioinformatics,  as  it  pertains  to  the  Information  Directorate,  can  be  looked  at  from  a 
couple  of  different  viewpoints.  First,  one  can  seek  to  exploit  biological  databases  and 
other  experimental  results  to  leverage  other  researchers’  progress  to  facilitate  the 
development  of  biotechnology  for  new  applications.  This  is  the  viewpoint  most  likely 
supported  by  researchers  interested  in  developing  new  computing  capabilities.  Second,  it 
is  an  application  area  that  can  leverage  AFRL/IF’s  experience  with  databases,  managing 
information,  and  fusing  different  sources  of  information  for  sound  decision  making. 
Experimentally,  biologists  and  other  researchers  can  gather  data,  but  the  problem  is  in 
understanding  the  information.  There  is  a  wide,  “...range  of  applications  of  bioinformatics 
to  molecular  biology,  clinical  medicine,  pharmacology,  biotechnology,  agriculture, 
forensic  science,  anthropology  and  other  disciplines.”  [26]  Numerous  computational 
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biology  tools  were  developed  utilizing  a  variety  of  bioinformatie  approaehes  ineluding 
statistical,  modeling  artificial  neural  networks,  hidden  Markov  models,  data  mining,  etc. 
Biologists  and  other  scientists  in  various  government  organizations  and  other  medical 
research  communities  have  expressed  their  desire  for  AFRL/IF  to  invest  in  the  second 
viewpoint.  It  has  been  stated  that,  “...many  of  the  problems  faced  in  bioinformatics  are 
daunting  in  their  size  and  scope. ”[8]  Limited  resources,  however,  constrains  AFRL/IF’s 
computer  science  expertise  to  focus  primarily  on  command  and  control  needs  for  the 
warfighter.  The  National  Science  Foundation  provides  funds  for  research  and  holds 
workshops  to  bring  together  bioinformatie  researchers,  users  of  molecular  biology 
databases,  and  computer  scientists  to  learn  about  what  has  been  done  and  where  to  direct 
future  funds. [23]  Through  programs  such  as  BioCOMP,  DARPA  (with  support  from 
Information  Directorate  personnel)  seeks  to  improve  the  role  computation  plays  in 
biology. 


3,0  Computational  Biology  Tools 

Numerous  independent  computational  tools  are  used  by  researchers  to  obtain  a 
quantitative  understanding  of  biology.  The  BioCOMP  Program  sought  to  address  the 
non-integrated  issue  by  developing  an  open-source  environment  in  which  researchers 
could  bring  together  different  software  tools,  exchange  information  by  means  of  standard 
data  formats,  and  access  biological  databases.  Information  Directorate  involvement  in 
the  BioCOMP  Program  exposed  the  team  to  a  community  of  researchers  and  their  tools 
about  which  Information  Directorate  personnel  previously  had  limited  knowledge  and  no 
hands  on  experience.  Directorate  personnel  do,  however,  have  experience  with 
computational  tools  from  other  disciplines  including  VLSI  and  microsystem  design  tools. 
The  primary  reason  for  supporting  the  BioCOMP  Program  was  that  part  of  the  program 
was  originally  directed  at  understanding  the  role  biology  could  play  in  computing.  In 
order  for  technology  to  be  realized,  it  will  need  to  be  supported  with  the  appropriate 
modeling  and  simulation  tools  and  techniques.  Past  experience  indicates  that  a  process 
for  multiscale,  multiphysics  modeling  and  simulation  is  needed  to  bring  new  science  to 
build  future  hybrid  information  systems.  Figure  3  illustrates  the  different  levels  of 
multiscale,  multiphysics  modeling  and  simulation.  These  levels  of  simulation  are  based 
upon  in-house  microsystem  design  experience  and  are  applicable  to  either  a  top-down  or 
bottom  up  design  approach.  It  was  observed  that  designers  with  electronics  engineering 
undergraduate  degree  are  more  likely  to  use  a  top  down  approach  while  those  with  a 
mechanical  or  aerospace  engineering  background  were  more  comfortable  with  the  bottom 
up  approach.  The  three-dimensional  (3-D)  physical  level  represents  the  detailed  analysis 
of  the  physical  device  likely  involving  finite  element,  finite  difference,  finite  volume,  or 
boundary  element  methods.  The  reduced  order  modeling  level,  commonly  referred  to  as 
macromodeling  in  electronic  design  community,  involves  modeling  of  the  control 
circuitry  of  the  device  with  the  performance  characteristics  of  the  device  obtained  in  the 
3-D  physical  modeling  level  and  captured  by  a  mathematical  equation.  The  reduced 
order  model  represents  a  subsystem  of  the  whole  system  in  which  the  technology  is  to  be 
integrated  involving  VHDL-AMS  or  SPICE  modeling  formats.  The  system  level  portrays 
the  whole  system  bringing  together  all  of  the  subassemblies  utilizing  VHDL,  VHDL- 
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AMS,  or  VERILOG  A  formats.  The  biological  level  represents  all  of  the  new  sciences 
that  are  being  examined  for  exploitation  in  alternative  computing  concepts  that  will  be 
used  in  hybrid  computing  architectures  and  not  currently  supported  in  common  design 
tools  and  processes.  The  open  question  is  how  does  one  bring  new  science  into  the 
process?  In  the  case  of  biotechnology  this  question  leads  to  several  more  questions: 

a.  If  one  models  a  biological  process  at  the  molecular  or  cellular  levels  what 
information  needs  to  be  extracted  from  the  simulation  results  in  order  to  develop  a 
practical  computing  concept? 

b.  What  information  must  be  passed  between  the  different  levels  of  modeling  and 
simulation? 

c.  How  is  information  passed  between  the  different  levels  of  modeling 
simulation? 

d.  Is  the  process  a  true  bottom  up  design  process,  or  can  a  level  be  skipped  and 
the  appropriate  reduced  order  models  be  established  without  intensive  three 
dimensional  physical  modeling? 

e.  Can  computational  biology  tools  be  used  to  develop  biotechnology  for 
computing? 

f  Can  computational  biology  tools  be  integrated  into  a  process  similar  to  that 

used  by  microelectromechanical  system  designers? 

g.  Should  computational  biology  tools  be  integrated  into  such  a  process? 

These  questions  are  difficult  to  address  and  it  was  never  the  intention  of  this  effort  to 
address  them  all,  but  to  start  building  the  foundation  of  knowledge  that  could  begin  the 
process.  One  of  the  first  steps  in  building  this  foundation  was  to  examine  different 
computational  biology  tools  and  understand  the  type  of  problems  they  address. 
Participating  in  the  BioCOMP  Program  provided  an  excellent  opportunity  to  evaluate  a 
variety  of  tools  as  it  sought  to  produce  an  open  source  software  environment  for 
computational  biology  tools. 

The  portion  of  the  BioCOMP  Program  that  focused  on  improving  the  role  computation 
plays  in  biology  centered  on  the  development  of  the  Biological  Simulation  Program  for 
Intra-Cellular  Evaluation  (Bio-SPICE).  Bio-SPICE  is  intended  to,  “...assist  in  the 
clarification  of  the  complexity  inherent  in  the  biochemical  networks  that  operate  within 
living  cells.  Intracellular  biochemical  networks  work  in  a  highly  orchestrated  fashion  to 
regulate  and  control  cell  behavior,  and  often  employ  complex  positive  and  negative 
feedback  loops  that  can  introduce  strong  nonlinearities  into  the  system.  The  dynamics  of 
such  systems,  even  in  relatively  simplified  mathematical  representations,  can  easily 
surpass  the  reasoning  capabilities  of  the  unaided  human  mind.  Helping  the  life  sciences 
community  begin  to  unravel  this  complexity,  BioSPICE  aims  to  provide  a  suite  of 
software  tools  and  reusable  model  elements  that  augment  the  capabilities  of  both  ‘belly  to 
the  bench’  and  ‘theoretical’  biologists  with  powerful  mathematical  and  automated 
reasoning  approaches.  More  specifically,  the  system  has  been  designed  to  accommodate 
both  naive  and  the  power  users.”  [25]  As  the  program  evolved,  the  various  teams  changed 
along  with  the  Bio-SPICE  software  environment  and  the  computational  tools  that  worked 
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Figure  3.  Illustration  of  the  various  levels  of  modeling  and  simulation  required  for 

developing  hybrid  architecture. 


in  it.  Appendix  A  provides  a  snapshot  of  different  tools  available  in  the  BioCOMP 
Program.  The  information  in  the  table  was  extracted  from  a  list  at  the  program’s  website 
(www.biospice.org)  established  and  maintained  by  SRI  International,  Menlo  Park,  CA, 
lead  system  integrator  for  Bio-SPICE,  during  the  BioCOMP  Program.  The  information 
has  now  moved  to  SourceForge.net  because  of  completion  of  the  DARPA  program.  In 
2003,  two  special  issues  of  OMICS,  Volume  7,  Numbers  3  and  4,  were  dedicated  to  Bio- 
SPICE  with  several  articles  highlighting  contributions  and  technical  progress  made  by 
various  teams  in  the  Bio-COMP  Program.  University  of  Texas  research  examined  the 
usability  of  Bio-SPICE  and  several  tools  that  work  in  the  Bio-SPICE  environment  using 
their  experience  to  create  a  tutorial  manual  for  first-time  users.  Towards  the  end  of  the 
program  CFD  Research  Corporation  was  funded  to  develop  cellular  modeling  capability 
using  partial  differential  equations.  Researchers  from  Vanderbilt  University  were  also 
brought  in  to  examine  how  their  expertise  with  model-based  design  and  integration 
techniques  could  help  facilitate  the  maturation  of  the  tool  integration  framework. 
Naturally,  the  ease  with  which  the  tools  worked  in  Bio-SPICE  varied.  Factors  that 
influenced  the  performance  of  the  tools  were:  funding  levels  of  the  different  projects. 
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reliance  on  legacy  programming,  ability  to  adapt  to  change  as  the  Bio-SPICE  software 
environment  evolved,  computer  skills  of  the  research  team,  and  experience  of  the  team 
and  the  principal  investigators. 


4,0  Utilization  of  Computational  Biology  Tools 

Prior  to  this  effort  there  was  limited  hands-on  experience  within  the  Information 
Directorate  with  computational  biology  tools.  The  objective  of  this  portion  of  the  project 
was  to  examine  the  usability  of  the  Bio-SPICE  software  environment  and  demonstrate 
how  it  could  be  used  to  develop  biotechnology  for  computing.  During  this  effort  there 
were  several  releases  of  Bio-SPICE  as  it  continued  to  evolve  with  a  release  happening 
every  six  months.  Two  college  students  employed  for  the  summer  by  the  Information 
Directorate  at  the  Rome  Research  Site  worked  with  Bio-SPICE.  One  of  their  assigned 
tasks  required  them  to  examine  the  usability  of  the  current  version  of  software  during  the 
time  they  were  employed.  The  usability  evaluation  examined  how  easily  the  software 
could  be  installed,  set  up  and  used  to  run  sample  problems  that  tool  developers  provided 
with  the  software.  Essentially  the  students,  who  were  not  computer  science  majors,  but 
had  biology  backgrounds,  were  asked  to  use  the  documentation  that  was  provided  with 
the  software  to  get  it  up  and  running.  Since  the  students  were  each  employed  during  two 
consecutive  summers  they  naturally  worked  with  different  versions  of  Bio-SPICE.  They 
worked  with  as  many  modules  as  possible  until  approximately  six  weeks  were  left  in  their 
summer  term  of  employment.  During  the  last  six  weeks  the  focus  was  shifted  to 
demonstrating  how  Bio-SPICE  could  be  applied  to  examining  the  role  biology  could  play 
in  computing.  A  summary  of  what  the  in-house  team  found  while  working  with  the  Bio- 
SPICE  will  be  presented  starting  with  Bio-SPICE  4.0,  then  Bio-SPICE  6.0,  and  wrapping 
up  with  comments  about  initial  work  with  Bio-SPICE  7.0. 

Bio-SPICE  4,0 

Bio-SPICE  4.0  represented  a  significant  improvement  in  the  Dashboard,  the  graphical 
user  interface  for  Bio-SPICE,  and  the  potential  for  linking  different  tools  together 
graphically  in  a  workflow.  Eigure  4  provides  a  snapshot  of  the  Dashboard  with  a 
workflow  on  it.  A  workflow  is  the  arrangement  of  some  combination  of  input  files, 
modelers,  simulators,  analyzers  and  output  processes  brought  together  by  the  user  on  the 
Dashboard.  Improvement  in  documentation  for  the  software  was  seen,  but  more 
refinement  was  still  needed  especially  if  someone  with  minimal  computer  skills  was  to 
try  to  use  the  tools.  Ideally,  a  person  with  minimal  computer  skills  would  like  to  install 
the  software  and  start  using  it  right  away,  but  with  this  version  of  the  software  it  was 
found  that  more  than  minimal  skills  were  needed  to  try  to  get  the  different  software 
modules  running.  Numerous  times  pathnames  had  to  be  modified,  installation  scripts 
examined,  and  developers  contacted  to  work  the  errors  and  bugs  that  were  encountered. 
One  demonstration  involved  linking  the  Datawarehouse,  NCA  Analyzer,  Geneways, 
Graphviz,  and  Pathway  Builder  modules,  hollowing  the  demonstration  steps  revealed 
one  problem  with  documentation  for  evolving  software.  A  module  was  created  to  link 
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Matlab  with  Bio-SPICE,  that  eliminated  the  requirement  for  using  the  NCA  Analyzer,  but 
the  demonstration  doeumentation  was  not  updated  to  reflect  this 


Figure  4.  Snapshot  of  a  Dashboard  workflow. 

change.  All  the  modules  were  eventually  installed,  but  another  error  was  encountered 
when  software  available  in  Linux  was  not  available  in  Windows.  While  a  work  around 
was  available,  it  was  determined  that  a  significant  amount  of  effort  would  be  expended, 
preventing  any  examination  of  other  tools.  The  decision  was  made  to  set  aside  this 
particular  demonstration  and  look  at  other  tools. 

The  next  group  of  evaluated  modules  were:  the  Metacluster  Toolbox,  CloneUpdater,  and 
the  three  PAINT  modules:  FeasnetBuilder,  Feasnet Adaptor,  and  FeasnetViewer.  The 
installation  and  building  of  the  demo  workflow  are  all  outlined  explicitly  in  a  file.  The 
installation  of  the  software  modules  was  pretty  straightforward.  FeasnetBuilder  required 
a  username  and  password  to  access  the  appropriate  database.  FeasnetViewer  problems 
were  encountered  and  after  a  few  attempts  to  fix  it  the  decision  was  made  to  move  on  to 
evaluating  a  different  set  of  modules.  The  capability  of  this  particular  tool  set,  at  the  time 
of  this  evaluation,  was  found  to  be  inadequate  to  demonstrate  how  Bio-SPICE  could  be 
used  to  develop  biotechnology  for  computing. 


14 


During  this  time,  a  problem  with  the  Bio-SPICE  Dashboard  oeeurred  when  trying  to  load 
saved  workflows.  The  browsing  window  did  not  reeognize  workflow  flies  as  .wf 
extensions.  The  user  had  to  change  the  dropdown  menu  to  ‘all  files’  and  find  the 
appropriate  workflow.  This  feed  back  was  provided  to  the  Bio-SPICE  developers. 

BioCharon  and  BioCharon  SketchPad  were  also  installed  and  evaluated.  SketchPad 
provides  the  user  with  the  capability  to  draw  out  models  of  biological  reactions.  The 
graphical  model  is  then  analyzed  using  BioCharon,  which  converts  it  into  the  Charon 
language.  These  modules  leverage  University  of  Pennsylvania’s  previous  work  with 
Charon  to  now  model  and  analyze  hierarchical  hybrid  biochemical  systems  symbolically 
by  state-space  exploration  techniques.  The  installation  was  just  a  matter  of  unpacking  a 
zip  file  and  executing  the  .jar  file.  To  get  the  two  programs  to  interact  properly  on  the 
other  hand  was  very  unclear,  requiring  e-mail  correspondence  with  a  representative  of  the 
tools’  developers  to  learn  how  to  create  a  classpath  from  the  command  prompt  to  get  the 
two  programs  to  execute  together  and  properly.  The  functionality  of  the  program  was 
checked  by  modeling  a  simple  reaction,  the  electron  transport  chain  which  is  the  basis  of 
aerobic  respiration.  [43] 

The  time  had  now  arrived  to  shift  the  focus  of  the  project  to  use  the  computational  tools 
installed  to  implement  a  biocomputing  technique  or  idea.  G-protein  biological  switches, 
which  are  found  throughout  the  body  and  used  in  adrenaline  responses,  were  selected  as 
the  biological  basis  for  the  demonstration.  It  was  envisioned  that  these  switches  could 
replace  the  logic  gates  that  are  the  basis  of  silicon  computers.  A  scheme  for  a  binary 
adder  was  prepared  using  these  G-protein  switches  instead  of  logic  gates.  The  reactions 
that  defined  this  biological  switch  were  modeled  using  SketchPad  and  references  to 
biology  that  were  found  on  the  internet.  [20,21]  Eigure  5  shows  the  schematic  of  an 
electrical  full  adder  circuit.  Eigure  6  illustrates  a  biological  representation  of  the  same 
circuit.  Eigure  7a  shows  a  closer  look  at  a  protein  switch  used  to  create  the  Bioadder  and 
its  representation  in  Eigure  7b. 


Eigure  5.  Illustration  of  a  full  adder. 
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Figure  6.  Illustration  of  a  Bioadder. 
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Figure  7.  Shows  the  coneept  of  the  protein  switeh  (a)  and  how  it  is  represented  in 

Sketchpad  (b). 


17 


Bio-SPICE  6.0 


Since  Bio-SPICE  continued  to  evolve  during  the  BioCOMP  Program,  another  look  at 
using  Bio-SPICE  and  expanding  upon  the  demonstration  from  the  previous  summer  was 
warranted.  A  noticeable  change  to  the  software  from  past  versions  was  the  ability  of 
clicking  on  a  link  for  the  update  center  which  automatically  installs  the  updates  to  various 
software  modules.  In  past  versions  the  user  had  to  manually  uninstall  the  software  in 
order  to  load  the  latest  versions  including  Bio-SPICE  and  its  Dashboard.  At  times  this 
seemed  to  be  problematic  since  it  was  hard  to  determine  if  all  of  the  components  of  the 
software,  such  as  portions  of  the  NetBeans  development  environment,  were  uninstalled 
completely.  The  process  for  doing  manual  updates  was  also  improved  because  issues 
with  legacy  software  made  it  difficult  for  some  teams  to  make  use  of  the  automatic 
update.  Better  guidance  was  also  provided  to  tool  developers  so  that  they  could  integrate 
their  tools  into  Bio-SPICE.  These  improvements,  however,  didn’t  mean  all  tools  were 
running  better  in  Bio-SPICE.  We  found  that  tools  that  ran  in  the  previous  version  did  not 
run  in  the  current  version  of  the  software.  BioSketchPad,  BioCharon,  Bio  Spreadsheet, 
Exact  Stochastic  Simulator  would  run  in  stand  alone  configurations,  but  not  in  Bio- 
SPICE  6.0.  The  teams  were  contacted  via  e-mail  and  various  options  of  installing  the 
software  and  paths  to  software  were  examined,  but  the  problem  still  existed.  The 
problem  was  documented  for  the  Bio-SPICE  support  team  to  address.  Subtle  variations 
in  hardware  and  software  configurations  on  individual  platforms  were  blamed,  but  a 
definite  answer  was  never  pinpointed.  Interestingly  in  this  Bio-SPICE  evaluation, 
JDesigner  and  Jarnac  software,  originally  written  for  the  Software  Biology  Workbench, 
not  only  ran  in  stand  alone  configuration,  but  also  in  the  Bio-SPICE  environment. 

At  the  time  this  work  was  performed,  JDesigner  was  found  to  be  the  most  useful  and 
robust  model  and  simulation  program  for  the  development  of  the  reaction  pathways  being 
examined.  The  tool  allowed  for  a  neat  and  accurate  representation  of  the  specific  signal 
transduction  pathway  or  gene  regulation  network.  JDesigner  contained  several  rate  laws 
that  could  be  integrated  into  the  pathway  or  network  and  provides  an  option  to  customize 
a  particular  rate  law  in  order  to  achieve  a  desired  output.  JDesigner  provides  three 
different  modes  of  simulation  and  analysis:  time  course  simulation,  steady  state  analysis, 
and  stoichiometry  analysis.  These  three  modes  of  simulation  and  analysis  can  produce 
either  tabular  or  graphical  output.  When  integrated  with  BioSPICE  Dashboard  v6.0,  the 
analytical  capability  of  the  software  was  increased  since  several  other  operators  can  be 
applied  to  the  given  model  and  additional  outputs  obtained.  JDesigner  can  convert  the 
model  designed  in  its  environment  into  systems  biology  markup  language  (SBME)  code. 
The  versatility  and  user-friendliness  of  the  JDesigner  program  led  to  its  selection  for  the 
modeling  of  reaction  pathways  in  this  project. 

After  exploring  the  usability  of  Bio-SPICE,  the  focus  shifted  to  expanding  upon  the 
demonstration  performed  the  previous  summer.  The  work  explored  a  development  path 
for  a  potential  biomolecular  computing  concept  which  exploits  proteins,  gene  network 
regulation  and  signal  transduction  pathways  to  execute  computational  functions. 
JDesigner  provides  an  appropriate  environment  for  the  analysis  of  a  known  signal 
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transduction  pathway  or  gene  regulation  network,  and  the  design  and  development  of  an 
arrangement  of  eomputational  eomponents  using  a  speeifie  reaetion  pathway.  A 
guanosine  triphosphate  (GTP)  signaling  pathway  was  eonstrueted  allowing  the 
manipulation  of  different  inputs  in  order  to  observe  possible  effeets  on  the  output  of  the 
system.  An  arrangement  of  eomputational  eomponents  was  also  ereated  eonsisting  of  a 
speeifie  number  of  easeade  eyeles  that  funetioned  as  NAND  and  NOT  logie  gates. [54] 
The  ability  of  JDesigner  to  reproduee  output  that  was  obtained  from  experimental  studies, 
supports  its  use  as  a  tool  for  analysis  and  modifieation  of  known  signal  transduetion 
pathways  and  gene  regulation  networks.  The  artiele  also  provided  kinetie  eonstants 
deseribing  the  rate  laws  for  the  stimulation  and  inhibition  of  adenylate  eyelase  in  the 
presenee  of  sodium  ehloride  whieh  was  reprodueed  during  the  software  evaluation. 

The  JDesigner  program  also  provided  an  environment  whieh  faeilitated  the  design  of  an 
arrangement  of  eomputational  eomponents  using  a  reaetion  pathway  to  represent  logie 
gates.  A  logic  gate  proeesses  a  signal  that  represents  true  of  false.  [18]  The  logie  gate 
requires  that  the  signal  transduetion  pathway  has  a  fast  ehange  in  eoneentration  of  a 
partieular  protein  or  moleeule,  whieh  is  eontrolled  by  a  threshold  level  on  an  input.  The 
switeh  from  one  eoneentration  level  to  another  in  a  pathway  is  known  as  ultrasensitivity 
and  it  allows  the  implementation  of  Boolean  funetions  to  the  reaetion  system  [45].  The 
reaetion  pathway  that  was  used  to  build  a  logie  gate  was  the  eyclie  reaetion  between  a 
kinase  protein  and  a  phosphatase  protein,  whieh  is  part  of  a  signal  transduetion  pathway. 
The  ultrasensitivity  property  of  this  reaetion  pathway  enables  it  to  be  manipulated  to 
produee  a  logie  gate.  The  steady-state  analysis  of  the  eyelie  reaetion  yields  a  sigmoid 
eurve  and  by  deereasing  the  Km  values  of  the  enzymes,  the  reaetion  will  inerease  the 
degree  of  sigmoidieity  of  the  eurve,  resembling  a  switehing  aetion  [45].  Using  a  easeade 
eyele  a  NOT  gate  and  a  NAND  gate  were  ereated  in  JDesigner.  Figure  8  illustrates  a 
standard  eleetronie  NAND  gate  while  Figure  9  shows  the  biologieal  representation  of  the 
gate.  The  differenee  between  the  NOT  gate  and  the  NAND  gate  was  that  the  eyelie 
reaetion  representing  the  NOT  gate  required  one  input,  while  the  eyelie  reaetion 
representing  the  NAND  gate  required  two  inputs.  These  two  gates  were  analyzed  using 
the  time  eourse  simulation  to  see  what  outputs  would  result  from  different  inputs.  The 
Vmax  value  and  Km  value  of  the  Miehaelis-Menten  rate  laws  were  manipulated  to  obtain 
the  desired  outputs  for  the  NOT  and  NAND  gate  operators  eorresponding  to  speeifie 
inputs.  These  two  logie  gates  were  then  used  to  produee  an  arrangement  of 
eomputational  eomponents  eonsisting  of  a  NOR  gate,  an  AND  gate,  and  an  OR  gate  as 
illustrated  in  Figure  10.  Figure  1 1  shows  the  equivalent  arrangement  made  up  of  NOT 
and  NAND  gates.  This  arrangement  of  eomputational  eomponents  was  analyzed  using 
the  time  eourse  simulation  obtaining  the  desired  output  eorresponding  to  speeifie  inputs 
after  manipulation  of  the  Vmax  value  and  Km  value  of  the  Miehaelis-Menten  rate  laws 
inherent  to  the  arrangement.  Onee  again  it  was  demonstrated  that  eomputational  biology 
tools  like  those  found  in  Bio-SPICE  ean  be  used  to  explore  how  biology  ean  play  a  role 
in  eomputing. 
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Figure  8.  Illustration  of  a  NAND  gate  with  its  truth  table. 
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Figure  9.  NAND  gate  in  JDesigner. 
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Figure  10.  Combination  of  a  NOR  gate,  an  AND  gate,  and  an  OR  gate. 
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Figure  11.  Another  way  to  represent  the  function  illustrated  in  Figure  10  using  a 
combination  of  logic  gates  using  only  NOT  and  NAND  gates 
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Figure  12.  JDesigner  representation  of  the  logic  gates  shown  in  Figure  11. 


Bio-SPICE  7.0 

Evaluation  of  Bio-SPICE  continued  over  time  at  different  levels  of  intensity.  Problems 
were  brought  to  the  attention  of  the  appropriate  individuals  or  teams.  The  in-house 
experience  was  valuable  in  providing  feedback  to  the  University  of  Texas  team  since  they 
wrote  a  tutorial  for  first  time  users  and  SRI  International  because  they  were  preparing  the 
software  for  life  after  DARPA  on  SourcePorge.net.  As  the  BioCOMP  Program  winds 
down  Bio-SPICE  continues  to  be  refined  with  more  focus  on  getting  better 
documentation  and  making  the  software  more  robust.  Originally  set  up  to  run  on  Einux 
and  Windows  platforms,  the  software  can  now  be  run  with  Mac  OS.  The  workflow  is 
now  portable  across  platforms,  but  now  a  new  problem  has  arisen. 

As  the  Air  Porce  information  assurance  policies  are  being  standardized,  some  issues 
have  come  to  light  that  were  not  a  problem  with  previous  versions  of  Bio-SPICE.  In  the 
APRE/IP  computing  environment,  restrictions  on  those  who  have  administrative 
privileges  now  impede  researchers  from  trying  to  troubleshoot  problems  when  working 
with  an  evolving  piece  of  software.  Pixes  from  tool  developers  under  contract  cannot  be 
installed  and  exercised  without  an  administrator.  This  introduces  another  barrier  to 
learning  and  understanding  the  nature  of  a  problem  by  reducing  the  opportunity  to  get 
hands-on  experience.  During  evaluations  of  Bio-SPICE  4.0  and  6.0  several  changes  to 
the  software  were  made  each  day  for  several  days  as  attempts  were  made  to  get  the 
software  running  correctly.  Some  software  that  was  previously  running  will  not  run 
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under  Bio-SPICE  7.0  because  the  user  does  not  now  have  the  correct  permission. 
Permissions  can  only  be  changed  by  the  administrator  who  has  to  be  called  back  after 
solving  one  problem,  then  leaves  to  help  others,  only  to  be  called  back  for  the  next 
problem  which  the  user  didn’t  know  about  because  the  previous  problem  prevented  the 
software  from  running  to  the  point  of  the  next  permission  issue.  Furthermore,  Bio-SPICE 
was  set  up  to  be  open  source  software  which  now  requires  special  permission  to  install 
since  it  is  non-standard  software  for  the  Air  Force.  There  is  currently  no  set  time  frame 
for  granting  or  denying  the  use  of  open  source  software  so  the  true  impact  on  trying  to 
use  the  software  and  meeting  project  milestones  is  not  known.  In  the  time  of  shrinking 
budgets  the  use  of  open  source  software,  which  has  no  purchase  price  or  license  fees, 
provides  a  cost  effective  means  to  explore  how  different  software  tools  and  techniques 
may  or  may  not  be  used  to  develop  new  technologies.  Commercial  software  can  be 
expensive,  especially  when  dealing  with  some  of  the  electronic  design  automation  tools 
used  for  computer  hardware  that  would  not  be  used  every  day  or  by  a  large  number  of 
researchers.  Furthermore,  it  may  only  take  twenty  minutes  of  trying  to  use  a  tool  to 
realize  it  will  not  be  helpful.  The  potential  exists  for  wasting  resources  on  commercial 
software  found  to  be  inadequate  for  a  research  and  development  project  if  open  source 
software  cannot  be  readily  used  and  evaluated.  Significant  manpower  may  be  expended 
on  the  paperwork  to  install  a  non-standard  piece  of  software  only  to  find  out  after  a  few 
minutes  that  it  is  not  adequate  for  the  task  at  hand.  Currently  AFRF/IF  is  examining 
alternative  arrangements  with  Air  Force  Information  Assurance  authorities  to  try  and 
alleviate  these  impediments  to  efficient  R&D  software  evaluation  and  development.  No 
final  decision  has  been  made  at  this  point  on  these  alternatives. 


5.0  Future  Challenges  for  the  Development  of  Computing  Hybrids 

In  order  for  the  concept  of  computing  hybrids  to  be  realized  several  more  technology 
answers  are  needed  to  overcome  the  difficult  task  of  integrating  new  science  with  proven 
technology.  From  the  perspective  of  leveraging  biotechnology  one  author  stated  that, 
“There  is  no  longer  any  doubt  as  to  the  theoretical  potential  of  DNA  computing. ”[29]  The 
challenge  now  is  to  develop  a  practical  implementation  of  not  only  DNA  computing,  but 
other  concepts  determined  to  have  strong  potential. 

If  biology  is  to  provide  complementary  computing  technologies,  the  specific  process 
needs  to  be  understood.  The  appropriate  models  of  the  biological  process  targeted  for 
exploitation  must  be  created  and  validated.  It  was  demonstrated  that  there  are 
computational  tools  available  that  will  help  researchers  understand  biological  processes 
for  computing.  There  is  room  for  improvement  and  more  work  will  be  needed.  As  the 
appropriate  modeling  and  simulation  tools  and  techniques  are  established  for  a  particular 
process,  the  key  information  must  be  extracted  and  passed  on  to  the  next  level  of  the 
design  path.  At  this  point  it  is  not  clear  what  that  information  is  or  in  what  form  it  must 
take.  Those  familiar  with  current  state-of-the-art  electronic  design  automation  (EDA) 
would  be  comfortable  if  the  new  science  could  be  implemented  in  the  existing  tools,  but 
it  is  not  clear  if  this  is  possible  or  if  the  tools  are  up  to  the  task.  Will  a  paradigm  shift  in 
the  way  a  computer  system  is  designed  be  needed  to  exploit  biotechnology?  It  was  stated 
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that,  “Much  work  has  already  been  done  on  shrinking  the  level  of  eireuits  and  switches 
found  in  eonventional  silieon  arehiteeture  down  to  the  atomie  level,  using  moleeular 
wires,  switehes,  and  the  Coulomb  effeet  and  ‘nano-islands’  to  allow  the  eneoding  of  a  bit 
in  the  presence  or  absenee  of  one  electron... But  we  should  remind  ourselves  again  that 
this  is  not  how  biological  organisms  process  information.”  [48]  Other  researchers  share 
this  opinion  and  have  begun  looking  at  how  to  model  the  future  systems,  but  we  are  far 
from  a  standardized  proeess  that  is  applieable  to  implementing  a  broad  range  of  eoneepts. 
Furthermore,  there  is  a  need  to  establish  the  right  metrics  to  gauge  performance 
eharaeteristies  of  the  new  eomputing  eoneepts  whieh  also  may  require  a  shift  from  the 
eurrent  eoneepts.  This  gets  even  more  eomplieated  if  one  has  to  aeeount  for  quantum 
effeets  along  with  other  issues  related  to  nanoteehnology.  Ideally  it  would  be  nice  to 
have  a  software  tool  for  the  design  of  hybrid  arehitectures  that  was  an  expert  system  that 
eould  address  a  mix  of  general  purpose  silieon  eomputing  teehnology,  nanoteehnology, 
bioteehnology  and  quantum  seienees.  A  user  would  sit  down  at  a  workstation,  sketeh  a 
eoneept  for  a  hybrid  eomputing  coneept  pulling  in  the  eomponents  from  a  very  large  cell 
library,  and  let  the  software  optimize  the  design.  While  much  more  work  is  needed 
before  we  will  eome  elose  to  the  ideal  eapability,  some  researeh  and  development  activity 
on  a  portion  of  this  concept  is  already  underway.  [3  8]  What  is  needed  now  is  to  seleet  a 
eouple  of  alternative  eomputing  eoneepts,  and  examine  the  proeess  for  building  a  hybrid 
arehiteeture  with  them.  The  objective  would  be  to  determine  how  well  the  existing 
software  tools  and  teehniques  ean  be  utilized  to  identify  the  teehnology  gaps  against 
whieh  to  foeus  future  research  and  development  aetivity  that  would  faeilitate  the 
maturation  of  the  teehnology. 


6.0  Concluding  Remarks 

This  projeet  provided  the  opportunity  to  gain  hands  on  experienee  to  eomplement  the 
knowledge  gained  from  partieipation  in  various  DARPA  Programs  that  involved 
bioteehnology  along  with  related  modeling  and  simulation  eapabilities.  It  established  a 
foundation  on  which  new  researeh  projeets  ean  be  built  to  take  bioteehnology  from  the 
laboratory  into  the  field  as  advaneed  hybrid  information  systems.  Much  effort  was  spent 
exploring  published  researeh  to  understand  what  others  have  aeeomplished,  problems 
they  eneountered,  the  potential  of  the  bioteehnology  to  advanee  eomputing  teehnology, 
and  to  identify  potential  paths  on  whieh  to  focus  future  researeh.  Various  eomputational 
biology  tools  were  also  examined  and  found  to  have  the  potential  to  help  develop 
bioteehnology  for  future  eomputing  arehiteetures.  Key  to  the  suceessful  maturation  of 
new  teehnology  is  having  the  proper  modeling  and  simulation  tools  and  teehniques  to 
integrate  the  technology  into  future  systems.  Computational  biology  tools  will  play  an 
important  part  in  expanding  the  role  biotechnology  plays  in  computing.  The  next  step  is 
to  seleet  a  few  alternative  eomputing  eoneepts  and  examine  how  to  integrate  them  into  a 
hybrid  arehiteeture,  defining  a  proeess  for  maturation  of  the  teehnology  for  praetieal 
applieation. 
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Appendix  A  -  Snapshot  of  Computational  Biology  Tools 


Below  is  a  snapshot  of  computational  biology  tools  encountered  while  participating  in 
DARPA’s  BioCOMP  Program.  The  tools,  like  the  Bio-SPICE  environment  itself 
continued  to  evolve. 


Tool  Name 
BioGrid 

BioMat  Bridge 

BioNetS 


BioSenS 


BioSmokey 

BioSpreadsheet 

BioWarehouse 

Biowarehouse2SBML 

CellX 


Summary 

BioGrid  distributes  simulation  and  analysis  jobs  across  a  cluster  of  computers  by 
creating  individual  SBML  files  for  each  model  parameter,  simulating  each  model, 
and  post-processing  the  output.  On  completion,  BioGrid  creates  a  web  page  with 
the  results  and  links  to  relevant  models  and  files  created. 

BioMat  Bridge  integrates  Matlab  into  the  Bio-SPICE  Dashboard,  enabling  a 
workflow  to  enter  and  exit  Matlab ’s  computation  engine,  permitting  users  to  utilize 
any  functionality  supported  by  Matlab  seamlessly  within  the  Dashboard. 

BioNetS  simulates  stochastic  models  of  biochemical  networks  efficiently  and 
accurately.  Each  chemical  species  can  be  specified  as  discrete  or  continuous,  with 
discrete  variables  simulated  with  the  Gillespie  algorithm,  and  continuous  variables 
simulated  with  chemical  Eangevin  equations,  or  stochastic  differential  equations. 
The  multiple  simulation  methods  allows  the  use  to  incorporate  the  appropriate 
method. 

BioSens  provides  a  sensitivity  analysis  toolkit  to  investigate  system  output  changes 
with  respect  to  parameter  variations.  Sensitivity  coefficients  of  the  model  are 
calculated  using  finite  difference  approximations,  allowing  the  user  to  compute  the 
system  sensitivity  to  model  parameters  and  initial  conditions.  In  addition,  Eisher 
Information  Matrix  based  sensitivity  measures  provide  a  consolidation  of  the 
information  in  dynamical  sensitivities  and  allow  easy  comparisons  of  the 
dependence  of  the  overall  system  behavior  with  respect  to  the  parameters. 
BioSmokey  provides  an  easy  well  to  manipulate  time  series  data,  with  functions 
that  include;  column  count,  tab  delimited  conversion,  column  deletion,  get  column, 
get  rows,  row  count,  and  sample. 

BioSpreadsheet  allows  users  to  easily  create  new  SBML  models  of  biochemical 
processes  or  edit  an  existing  SBML  model. 

BioWarehouse  supplies  a  toolkit  for  constructing  a  warehouse  of  bioinformatics 
databases,  including  loaders  for  databases  such  as  SWISS-PROT,  KEGG,  and 
NCBI  Taxonomy. 

Biowarehouse2SBML  extracts  stoichiometric  and  reversibility  conditions  for  all 
reactions  in  a  pathway  in  the  BioWarehouse,  producing  an  SBML  model  suitable 
for  metabolic  flux  analysis. 

CellX  simulates  multi-scale  cell  models  while  accounting  for  reaction  and  transport 
processes  as  well  as  the  attendant  intracellular  gradients  of  composition.  The 
simulations  are  performed  on  multiple  dimensions,  for  example,  reaction-transport 
equations  are  solved  along  fibrils  (1-D),  on  membranes  (2-D)  and  within  bulk 
media  (3-D),  all  simultaneously  and  with  full  coupling  to  accounts  for  molecular 
exchange  between  these  domains. 
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Charon  models  and  analyzes  hierarchieal  hybrid  bioehemical  systems  symbolically 
by  state-space  exploration  techniques.  This  symbolic  analysis  allows  users  to 
determine  the  feasibility  of  certain  model  behaviors,  steady-state  characteristics, 
etc. 

Clone  Updater  updates  gene  and  EST  clone  annotations  while  identifying  and 
eliminating  redundant  clones. 

ESS  performs  stochastic  simulations  using  an  optimized  version  of  Gillespie’s 
exact  stochastic  simulation  of  coupled  chemical  species  simulator. 

Eluxor  Computational  Analyzer  performs  flux  predictions  resulting  from  specific 
reactions,  selected  by  the  user,  having  their  fluxes  limited  or  completely  disabled. 
Eluxor  Spreadsheet  provides  a  spreadsheet-like  interface  to  allow  user  to  specify 
nutrient  conditions,  external  metabolites,  and  gene  knockouts,  and  to  display  the 
results  of  various  kinds  of  flux  predictions. 

GeneCite  permits  users  to  specify  sophisticated  sets  of  queries  and  generates  a  table 
of  the  number  of  citations  found  for  each  query.  The  table  can  be  presented  as  a 
web  page  or  in  standard  spreadsheet  format,  allowing  the  user  to  view  the  full 
output  of  only  those  queries  that  generate  an  interesting  number  of  citations. 

Genescreen  processes  gene  expression  data  with  a  collection  of  computational 
statistic  routines  to  extract  significant  gene  association  patterns. 

Geneways  allows  access  to  the  Geneways  Database,  which  contains  literature  from 
over  fifty  full  text  journals.  Users  can  add  additional  information  culled  from 
literature  to  an  existing  biochemical  model. 

Homologue  Einder  searches  for  homologous  genes  or  gene  products  for  each  gene 
or  gene  product  in  a  pathway  provided  by  the  user.  This  results  in  a  theoretical 
pathway  in  the  target  organism  with  found  homologues. 

Hybrid  performs  parameter  identification  of  biological  systems  through  using 
piecewise  affine  hybrid  automata. 

JDesigner  provides  a  visual  biochemical  network  simulation  tool  that  is  part  of  the 
Systems  Biology  Workbench,  permitting  users  to  graphically  specify  the  model, 
derive  the  representative  set  of  differential  equations  automatically  and  generate  a 
solution  simulatable  file  (JDesigner  itself  has  no  simulation  capability). 

JigCell  Tools  includes  several  capabilities  in  one  package.  The  Model  Builder 
builds  and  edits  SBML  models  using  a  spreadsheet  interface,  reducing  models  by 
finding  conservation  relations  in  the  equations.  The  Run  Manager  defines  an 
ensemble  of  runs  that  specify  how  to  simulate  SBME  models  with  various  changes 
to  parameter  and  initial  condition  values.  XPP  provides  access  to  Bard 
Ermentrout’s  XPP  simulator.  Biopack  provides  access  to  the  numerical  integration 
routines  of  ESODAR.  The  Comparator  executes  an  ensemble  of  runs,  applies  data 
transformations  to  the  simulation  output,  and  determines  how  well  the  model 
matches  the  collected  experimental  results  it  is  attempting  to  reproduce.  The 
Parameter  Estimator  fits  model  parameters  by  applying  local  and  global  optimizers 
to  find  values  that  bring  the  model  and  experimental  results  into  close  agreement. 
Compare!  allows  the  user  to  visualize  the  change  in  the  fit  of  a  model  to 
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experimental  results  as  a  model  is  developed  over  time.  The  Projeet  Manager  stores 
and  organizes  data  files  used  by  other  tools  into  logically  related  projects. 

KCA  provides  a  spatial  whole-cell  or  multi-cell  modeling  capability.  Taking  into 
account  a  network  of  biochemical  processes,  intracellular  architecture,  membrane 
transport  properties,  initial  cell  state  and  conditions  in  the  extracellular  system, 
Karyote  Cell  Analyzer  simulates  the  time  course  of  all  chemical  species  in  all 
cellular  compartments. 

KAGAN  generates  detailed  biochemical  information  about  gene  control  networks 
by  analyzing  raw  time  series  microarray  data.  KAGAN  can  also  use  timeseries  data 
integrated  with  a  subset  of  a  cell  biomic  simulator  to  perform  cell  simulations 
despite  model  incompleteness. 

IcDNA  estimates  confidence  intervals  for  mRNA  expression  levels  in  microarray 
experiments,  including  elimination  of  extreme  outliers,  quality  filtering, 
normalization  of  the  log  10  signal  intensity  ratios,  and  assessment  of  expression 
levels. 

MetaCluster  performs  cluster  analysis  with  several  algorithms,  including  the 
metacluster  algorithm,  which  identifies  trends  insensitive  to  the  clustering  method 
used,  implying  a  higher  confidence  in  the  clustering  analysis  results. 

MetaTool  derives  conclusions  about  the  pathway  structure  of  a  metabolic  network 
from  the  stochiometric  reaction  equations  and  information  about  reversibility  and 
irreversibility  of  enzymes. 

MIAME  Spice  packages  raw  and  normalized  datafiles  from  a  set  of  related 
microarray  experiments,  saving  all  associated  data  from  an  experiment  (or  set  of 
experiments)  into  one  archive  file.  Users  can  also  enter  experimental  annotations, 
array  design  information,  and  array  design  files. 

Monod  supports  collaborative  work  through  a  web-based  environment,  with  tools 
for  text-based  data  mining,  note  annotation,  literature  annotation,  user  workspace 
with  selective  sharing  properties,  and  model  editor. 

NCA  uses  available  connectivity  information  between  genes  and  transcriptional 
factors  and  gene  expression  level  time  course  data  (obtainable  through  DNA 
microarray  experiments)  to  estimate  parameters  and  infer  a  gene  transcriptional 
network  through  a  Matlab  analysis  routine. 

NYUMAD  facilitates  storage  of  microarray  data  and  simulation  time  series  data  in 
a  relational  data  base  management  system,  including  front-end  capabilities  for  data 
presentation  and  maintenance. 

The  NYU  Simulation  Database  (NYUSIM),  a  subset  of  NYUMAD,  stores  time 
series  data  from  simulations,  allowing  manipulation  and  cataloging  of  these  traces. 

OctaveBridge  provides  an  interface  to  a  free  scripting  language  and  simulation  tool 
that  can  be  used  as  an  alternative  to  Matlab  for  time  series  data. 

PAINT  allows  researchers  to  visualize,  identify,  and  analyze  gene  regulatory 
regions,  and  test  for  statistical  significance  to  rank  the  likelihood  of  the 
involvement  of  individual  transcription  factors  in  the  biology  under  study.  The 
results  are  written  in  suitable  formats  for  further  analysis  using  modeling  and 
simulation  tools. 
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Pathway  Screen  generates  a  the  pathways  that  a  list  of  genes  are  ineorporated  in 
and  a  link  to  a  pieture  of  these  pathways. 

Pathway  Builder  provides  software  tools  that  ean  be  used  to  graphieally  represent  a 
biological  pathway,  formally  define  eonceptual  models  of  proeesses  in  the 
pathway,  inelude  mathematieal  representations  of  those  models,  and  produee 
representations  of  these  models  exeeutable  in  Matlab  or  SBML  eompliant 
simulators. 

SAL  allows  highly  automated  abstract  modeling  of  biological  pathway  networks  to 
aeeelerate  understanding.  SAL  uses  modern  automated  symbolie  logie  to  represent 
biologieal  systems,  allowing  biologists  to  understand  eomplex  biologieal  systems, 
make  hypotheses,  and  refine  models. 

SBW  Optimizer  provides  an  interfaee  between  Matlab  and  the  Systems  Biology 
Workbeneh  (SBW),  allowing  users  to  aeeess  existing  SBW  modules  or  ereate  new 
modules. 

Sensitivity  Analyzer  performs  a  response  surfaee  analysis  of  a  bioehemieal  model. 
The  user  selects  features  of  the  model  behavior  and  the  tool  performs  an  efficient 
simulation  of  the  system  under  different  parameter  assumptions  to  better 
understand  the  dependenee  of  the  seleeted  feature  values  on  the  parameters. 
Simpathiea  simulates  a  bioehemieal  model  with  differential  algebraie  deseription, 
or  ordinary  differential  equations.  The  model  is  entered  using  a  library  of  template 
reaetions.  Simpathiea  also  offers  an  analysis  baekend  based  on  a  temporal  logie 
model  eheeker  eapable  of  analyzing  several  traees  at  a  time.  The  analyzer  provides 
feedbaek  to  the  user  about  particular  events  observed  in  the  behavior  of  the  system. 
SOS  Tools  solves  feasibility  or  optimization  sum  of  squares  problems  with  a  third- 
party  Matlab  toolbox.  The  solution  is  arrived  at  by  performing  sum  of  squares 
deeomposition  for  multivariate  polynomials,  whieh  is  effieiently  eomputed  with 
semi-definite  programming. 

see  searehes  for  graphieal  motifs  conserved  aeross  pathways  to  better  understand 
feedbaek.  The  program  searehes  for  strongly  eonneeted  components,  two 
eomponents  for  whieh  eaeh  eomponent  is  found  to  be  strongly  eonneeted  to  the 
other,  indieating  that  the  two  eomponents  are  very  likely  to  belong  to  a  eommon 
function  and  control  structure. 
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